Position Heaps for Permuted Pattern Matching on Multi-Track String
نویسندگان
چکیده
A multi-set of N strings of length n is called a multi-track string. The permuted pattern matching is the problem that given two multi-track strings T = {t1, . . . , tN} of length n and P = {p1, . . . , pN} of length m, outputs all positions i such that {p1, . . . , pN} = {t1[i : i+m−1], . . . , tN [i : i+m−1]}We propose two new indexing structures for multi-track stings. One is a time-efficient structure for T that needs O(nN) space and enables us to solve the problem in O(mN+occ) time, where occ is the number of occurrences of the pattern P in the text T. The other is memory-efficient, it requires only O(n) space, whereas the matching consumes O(mN + occ) time. We show that both of them can be constructed in O(nN) time.
منابع مشابه
KMP Based Pattern Matching Algorithms for Multi-Track Strings
Multi-track string is an N -tuple strings of length n. For two multi-track strings T = (t1, t2, . . . , tN ) of length n and P = (p1, p2, ..., pM ) of length m, permuted pattern matching is a problem to find all positions i such that P is permuted match with T[i : i+M ]. We propose three new algorithms for permuted pattern matching based on the KMP algorithm. The first algorithm is an exact mat...
متن کاملTransposition Invariant Pattern Matching for Multi-Track Strings
We consider the problem of multi-track string matching. The task is to find the occurrences of a pattern across parallel strings. Given an alphabet Σ of natural numbers and a set S over Σ of h strings si = s1 · · · s i n for i = 1, . . . , h, a pattern p = p1 · · · pm has such an occurrence at position j of S if p1 = s i1 j , p2 = s i2 j+1, . . . , pm = s im j+m−1 holds for i1, . . . , im ∈ {1,...
متن کاملOn-Line Construction of Position Heaps
We propose a simple linear-time on-line algorithm for constructing a position heap for a string [EMOW11]. Our definition of position heap differs slightly from the one proposed in [EMOW11] in that it considers the suffixes ordered in the descending order of length. Our construction is based on classic suffix pointers and resembles Ukkonen’s algorithm for suffix trees [Ukk95]. Using suffix point...
متن کاملA multi-pattern hash-binary hybrid algorithm for URL matching in the HTTP protocol
In this paper, based on our previous multi-pattern uniform resource locator (URL) binary-matching algorithm called HEM, we propose an improved multi-pattern matching algorithm called MH that is based on hash tables and binary tables. The MH algorithm can be applied to the fields of network security, data analysis, load balancing, cloud robotic communications, and so on-all of which require stri...
متن کاملA Two-layer Approach for Multi-track Segmentation of Symbolic Music
Music segmentation is a key issue in music information retrieval (MIR). Structural information about a composition achieved by music segmentation can improve several tasks related to MIR such as searching and browsing large music collections, visualizing musical structure, lyric alignment, and music summarization. Various approaches have been introduced to MIR, many of them recently concentrati...
متن کامل